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SOME  ASPECTS  OF  INFERENCE  FOR 

MULTIVARIATE  INFINITELY  DIVISIBLE  DISTRIBUTIONS 


Stanley  L.  Sclove 


ABSTRACT 


Measurement  of  dependence  In  the  Infinitely  divisible  class  of 
multivariate  distributions,  based  on  developments  In  probability  theory 
for  that  class,  Is  discussed.  It  has  been  shown  that  pairwise  Independ¬ 
ence  la  equivalent  to  mutual  independence  in  this  class.  When  the 
infinitely  divisible  variables  contain  no  normal  component  (in  particu¬ 
lar,  when  they  are  discrete),  the  cumulant  of  order  (2,2)  can  be  usud  as 
a  measure  of  pairwise  dependence;  when  a  normal  component  is  present, 
the  appropriate  measure  also  involves  the  covariance.  Results  for  test¬ 
ing  independence  of  infinitely  divisible  random  variables  are  discussed. 
A  method  of  testing  normality  against  infinitely  divisible  alternatives 
is  given. 

1 .  INTRODUCTION 

A  random  variable  (r.v.)  X  is  infinitely  divisible  (inf .div. )  if 
there  exists  a  triangular  sequence  Xjjj ,  n  >  1,  2,...,  j  1,  2,...,  n, 
such  that,  for  each  n  ■  1,  2,...,  the  n  r.v.’s  Xjj j ,  j  ■  1,  2,...,  n,  are 
independent  and  identically  distributed  and  the  variables  X^’^)  defined 
by  X^*')  ■  X^ji  +  Xj|2  +  ...  +  Xjjn,  n  -  1,  2,...,  all  have  the  same  distri¬ 
bution  as  X.  The  condition  in  terms  of  the  characteristic  function 
(c.f.)  (^(u)  of  the  r.v.  X  is  that,  for  each  n  there  exist  a  c.f. 
such  that  i^(u)  >  [<^{i(u)]’^.  The  notion  of  infinite  divisibility  applies 
whenever  a  notion  of  addition  is  defined;  e.g.,  the  variable  X  may  be  a 
vector  or  a  matrix. 


AMS  classification:  62H15,  62H20 

Key  words  and  phrases:  infinite  divisibility;  multivariate  normality; 

kiirtosls;  measures  of  dependence 
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The  marginal  distributions  of  an  Inf.dlv.  multivariate  distribution 
are  inf.dlv.  I.e.,  If  a  random  vector  (r.vec.)  X  -  (Xj,  X2,...,  Xp)' 

Is  Inf.dlv.,  then  each  variable  Xy  is  Inf.dlv.,  v  “  1,  2,...,  p.  Also, 
If  each  variable  of  a  r.vec.  Is  inf.dlv.  and  the  variables  are  Independ¬ 
ent,  then  the  ' .vec.  is  Inf.dlv.  But  the  elements  of  inf.dlv.  r.vec. ’s 
can  be  dependent;  so  the  class  of  multivariate  inf.dlv.  distributions  is 
quite  broad.  In  particular,  the  class  is  closed  under  affine  transfor¬ 
mations. 

Using  the  canonical  representation  of  the  c.f.  of  inf.dlv.  r.vec. 's 
(see,  e.g.,  [12]  and  the  discussion  below),  a  necessary  and  sufficient 
condition  for  mutual  independence  of  the  elements  of  the  r.vec.  was 
obtained  in  [9].  When  the  means  are  zero,  this  condition  is  simply  that 
the  squares  of  the  variables  be  uncorrelated.  Thus,  for  inf.dlv. 
r.vec.' 8,  not  only  does  mutual  independence  reduce  to  pairwise  independ¬ 
ence  but  also  the  parametric  characterization  of  dependence  is  simple. 

In  the  present  paper  some  methods  of  statistical  inference  based  on  this 
advance  in  probability  theory  are  developed. 

It  is  not  too  much  of  an  over-simplification  to  say  that  the 
applicability  and  relative  ease  of  Implementation  of  procedures  derived 
from  multivariate  normal  distributions  depends  upon  the  propriety  of  the 
correlation  coefficient  as  the  measure  of  dependence  between  variables 
which  are  jointly  normally  distributed.  As  will  be  discussed  below,  the 
development  in  [9]  Includes  a  measure  of  dependence  for  the  variables  of 
inf.dlv.  r.vec. 's.  The  class  of  multivariate  Inf.dlv.  distributions 
Includes  the  multivariate  normal  family  as  well  as  other  continuous 
multivariate  distributions.  Further,  it  Includes  discrete  multivariate 
distributions,  useful  for  modeling  data  such  as  that  generated  by  multi¬ 
variate  point  processes.  Finally,  and  perhaps  most  importantly,  the 
class  Includes  r.vec. 's  which  are  mixed  in  the  sense  that  some  variables 
are  continuous  and  others  are  discrete.  Here  are  some  examples  of 
sources  of  such  data,  to  mention  Just  two.  One  is  in  the  observation  of 
physical  systems  where  one  considers  continuous  measurements  of  energy. 
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phase,  angular  momentum,  together  with  counts  of  numbers  of  collisions, 
disintegrations,  etc.  Another  source  is  In  the  observation  of  bio¬ 
medical  systems  where  one  considers  continuous  measurements  of  blood 
pressures,  pulse  rates,  chemical  concentrations,  etc.,  together  with 
counts  of  red  and  white  blood  cells.  The  measures  of  dependence 
stemming  from  (9]  provide  a  method  for  the  systematic  treatment  of 
dependence  among  the  variables  of  such  mixed  r.vec.'s.  These  measures 
of  dependence  are  potentially  of  great  Importance  and  applicability,  as 
they  play  a  role  analogous  to  that  of  correlation  coefficients. 

Multivariate  Inference  problems  assuming  inf.div.  distributions  are 
relatively  tractable.  Results  demonstrating  this  are  given  below. 

Sections  2  and  3  present  some  results  for  Inf.div.  probability  laws;  the 
results  and  the  proofs  are  at  least  Implicit  In  [9].  These  results  form 
the  basis  for  the  remarks  on  Inference  of  Section  4. 

Now  suppose  X  Is  an  Inf.div.  r.vec.  and  let  be  Its  c.f.  The 

Kolmogorov  representation  for  the  logarithm  of  i>(u)  Is 

(1.1)  “  u’r^u/2  +  /lexp(iu' x)-l-l2i'xl  (^'x^“^M(dx) , 

where  M  Is  a  bounded  measure  having  no  mass  at  the  origin  and  Is  a 
positive  definite  matrix.  Let  U  H  V  mean  that  U  and  V  have  the  same 
distribution.  The  representation  (1.1)  means  that 

(1.2)  X=2^+2^, 

where  ^  and  are  Independent,  the  component  of  Gaussian  type, 

1. e.,  has  a  multivariate  normal  distribution,  and  the  component  Xp  Is  of 
Poisson  type,  l.e.,  has  log  c.f.  equal  to  the  Integral  In  (1.1). 

2.  TWO  VARIABLES 

Now  consider  two  jointly  inf.div.  r.v.'s  X  and  Y,  l.e.,  let  (X,Y)’ 
be  an  Inf.div.  r.vec. 
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2.1.  Characteristic  function 

The  Kolmogorov  representation  (1.1)  holds  in  the  case  of  finite 
variances  and  for  two  variables  Is  of  the  form 

(2.1)  '()(t,u)  *  itpi  +  luU2  “  t^Yll/Z  -  tuYi2  “  u2y22/2 

+  /{exp[l(tx  +  uy)]  -  1  -  Ktx  +  uy)}  (x^  +  y2)“lM(dx,dy) . 


2.2.  Ctimulants 


The  cumulants  k^s*  when  they  exist,  are  defined  by  the  expansion 
ip(t,u)  *  I  I  <r3  i^®t’'^u®/(rls!). 

We  refer  to  Kj.g  as  the  cumulant  of  order  (r,s).  The  cumulants  are  given 
by 


<20  -  Var(X)  - 
<02  ■  Var(Y)  - 
<11  -  Cov(X,Y) 
and,  for  r  +  s  >  3, 
(2.2)  <rg  -  /xV(x2 


Yii  /x2(x2  +  y2)“lM(dx,dy) , 
^22  y^)“^M(dx,dy) , 

-  ^12  +  y2)~lM(dx,dy) , 


+  y2)~lM(dx,dy). 


(These  formulas  are  readily  obtained:  When  the  cumulant  of  order  (r,s) 
exists,  it  is  the  mixed  partial  of  order  (r,a),  evaluated  at  (t,u)  ■  (0,0) 
in  this  case,  the  differentiation  may  be  passed  under  the  integral  sign.) 
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If  X  has  no  Gaussian  component,  then  the  cumulants  are  given  by 

•'lO  ”  '^1*  *^01  =  ^^2* 
and,  for  r  +  s  >  2, 

Krg  “  /x*^y®(x2+y2)~^M((lx,dy) . 

In  the  theory  of  multivariate  Inf.dlv.  distributions  the  cumulant  of 
order  (2,2),  *‘‘22*  plays  a  special  role;  following  [9],  denote  this 
functional  by  x(X,Y). 

As  Is  well  known  [l,p.39],  if  (X,Y)'  is  bivariate  normal,  then 
<j.g  ■  0  if  r  +  s  >  2;  in  particular,  then,  it(X,Y)«0  if  (X,Y)’  is 
bivariate  normal. 

Now  suppose  that  a  r.vec.  (U,V)'  has  the  property  that  (U,V)  5 

(Ui,Vi)  +  ...  +  (Un,Vj,  where  the  pairs  (Ui,Vi),  (U2,V2) . (U^.V^) 

are  independent.  Then  (Introducing  obvious  notation) 

I  I  *  'I'u  Eexp[i(tU+uV)l 

■  log  Eexp{l(c}^Uj+uJ[Vj)  J  *  log  E  H  exp{i(tUj  +  “Vj)] 

■  log  n  E  exp[i(tUj  +  uVj)]  ■  log  n  (|i  (t,u) 

*■  Ilog  't>..  ^  (t,u)  -  ^t|;  (t,u) 

j  ’  j  j  ’  J 

-  llj  Kr8(Uj.Vj)li>^*t*^u8/(r!8l) 

so  that 
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Now,  (X,Y)  =  (XqjYq)  +  (Xp,Yp),  where  the  summands  are  independent,  so. 
If  the  fourth  moments  are  finite,  so  that  ¥(X,Y)  exists, 

ir(X,Y)  -  ''22(X,Y) 

-  '"22(Xg.Yg)  +  ''22(Xp.Yp) 

»  0  +  ir(Xp,Yp)  -  Tr(Xp,Yp). 

Note  also  that,  if  (X,Y)'  is  Inf.dlv.  and  the  fourth  moments  are  finite, 
then 

(2.3)  ir(X,Y)  -  ^22<X,Y)  -  i|/ttuu<0.0)  “  /x2y2(x^  +  y2)-lM(dx,dy)  >  0. 
This  proves 

THEOREM  2.1.  If  (X,Y)’  is  inf.div.  with  finite  fourth  moments, 
then  ir(X,Y)  >  0. 

2.3.  Independence 


Now  let  us  examine  the  relationship  between  Independence  of  jointly 
Inf.dlv.  r.v.'s  X,Y  and  nullity  of  the  functionals  Cov(X,Y)  and  Tr(X,Y). 

Note  from  (2.3)  that  v(X,Y)  >  0  implies  that  M  puts  mass  only  on  the 
axes  of  the  (x,y)-plane,  l.e.,  Xp  and  Yp  are  independent.  This  proves 

THEOREM  2.2.  Let  (X,Y)'  be  Inf.dlv.  with  finite  fourth  moments 
and  no  Gaussian  component.  Then  X  and  Y  are  independent  if  and  only  if 
(iff.)  ir(X,Y)  -  0. 

In  particular,  discrete  r.vec.'s  can  have  no  Gaussian  component. 

Hence  we  have 

COROLLARY  2.1.  If  (X,Y)'  is  inf.dlv.  with  finite  fourth  moments 
and  discrete,  then  X  and  Y  are  Independent  iff.  ir(X,Y)  ■  0. 

THEOREM  2.3.  Let  (X,Y)'  be  an  inf.div.  r.vec.  Then  X  and  Y  are 
independent  if  Cov(X,Y)  ■  0  and  w(X,Y)  *  0. 

lA 
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PROOF.  We  have  (X,Y)  =  (Xq.Yq)  +  (Xp.Yp),  where  Xg  and  ^  are 
Independent.  To  show  independence  of  X  and  Y,  it  suffices  to  show  Xp,Yp 
Independent  and  Xq.Yq  independent.  Now, 

(i)  0  *  Tr(X,Y)  -  <22(X.Y)  -  <22(Xg.^G>  +  ^22(^P.'^p)  “  0  +  ^22(Xp,Yp), 

i.e.,  ir(Xp,Yp)  “  0,  which  by  Theorem  2.2  implies  that  Xp  and  Yp  are 
independent .  Then , 

(ii)  0  -  Cov(X,Y) 

-  Cov(XG+Xp,YQ+Yp) 

-  CovCXg.Yq)  +  Cov(Xp,Yp)  +  CovCXq.Yp)  +  CovCXp.Yg) 

■  Cov(Xq,Y(j)  +  Gov(Xp,Yp)  +0  +  0  (by  independence  of  and  5^) 

-  CovCXg.Yq)  +  0  (by  (i)) 

-  Cov(Xg,Yg). 

Therefore,  Cov(Xq,Yg)  «•  0,  and  Xg  and  Yq  are  independent  because  they 
are  Jointly  normally  distributed. 


If  X  and  Y  are  independent,  then  Cov(X,Y)  ■  0  and  also  the  cross- 
cumulants  are  null;  in  particular,  ir(X,Y)  >•  0.  Together  with  Theorem 
2.3  this  gives 


COROLLARY  2.2.  Let  (X,Y)'  be  an  inf.div.  r.vec.  Then  X  and  Y  are 
independent  iff.  Cov(X,Y)  *  0  and  ir(X,Y)  -  0. 

Now, 


(2.4)  Cov[(X-EX)2,  (Y-EY)2]  -  2[Cov(X,Y)]2  +  ''22(X,Y). 

Write  (2.4)  as  t(X,Y)  -  v(X,Y)  +  v(X,Y),  where  t(X,Y)  -  Cov[(X-EX)2,  (Y-EY)2] 
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and  v(X,Y)  -  2[Cov(X,Y) ]2. 

Here  one  may  think  of  t  (or  O  as  "Wtal,''  v  (or  ti)  as  "normal,"  and  n 
(or  2)  "]?ol3son-type.‘’  Thus  we  have 

LEMMA  2.1.  Let  (X,Y)  be  an  inf.div.  r.vec.  Then  Cov(X,Y)  =  0  and 
Tr(X,Y)  =  0  Implies  Cov[(X-EX)2,  (Y-EY)2]  -  0. 

LEMMA  2.2.  Let  (X,Y)'  be  an  inf.div.  r.vec.  Then  Cov[(X  -  EX)2, 
(Y-EY)2  -  0  implies  Cov(X,Y)  »  0  and  it(X,Y)  =  0. 

PROOF.  For  inf.div.  r.vec. 's,  it(X,Y)  >  0.  Hence  0  »  Cov[(X-EX)2, 
(Y-EY)2]  «  2[Cov(X,Y)]2  +  Tr(X,Y)  implies  Cov(X,Y)  -  0  and  7r(X,Y)  -  0. 

COROLLARY  2.3.  Let  (X,Y)'  be  an  inf.div.  r.vec.  Then  nullity  of 
both  Cov(X,Y)  and  ir(X,Y)  is  equivalent  to  nullity  of  Cov[(X-EX)2, 

(Y-EY)2]. 

THEOREM  2.4.  Let  (X,Y)'  be  an  inf.div.  r.vec.  Then  X  and  Y  are 
independent  iff.  Cov[x-EX)2,  (Y-EY)2]  -  0. 

PROOF .  If  (X,Y)'  is  inf.div.,  then  X  and  Y  are  independent  iff. 
Cov(X,Y)  -  0  and  7r(X,Y)  -  0,  and  this  holds  iff.  Cov[(X>EX)2,  (Y-EY)2]  -  0. 

If  we  know  that  (X,Y)'  is  bivariate  normal,  then  X  and  Y  are  inde¬ 
pendent  iff.  Cov(X,Y)  0,  If  we  know  only  that  (X,Y)'  is  inf.div.  (and 
not  necessarily  bivariate  normal) ,  then  we  can  be  assured  of  independence 
of  X  and  Y  only  if  ir(X,Y)  ■  0  as  well  as  Cov(X,Y)  “  0. 

2.4.  Measures  of  dependence 

Note  that  t(X,Y)  provides  a  measure  of  dependence  between  X  and  Y, 
in  the  sense  that  the  correlation 

Corr[(X-EX)2,  (Y-EY)^]  - 

Cov[(X-EX)2,  (Y-EY)2]/{Var[(X-EX)2][(Y-EY)2]}l/2 

is  at  most  one  in  absolute  value.  The  meaning  of  a  value  of  one  is  that 
there  exist  constants  a  and  b  such  that  (Y-EY)2  ■  a  +  b(X-EX)2  with 
probability  one. 
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Similarly,  ir(X,Y)  provides  a  measure  of  dependence  between  X  and  Y 
when  (X,Y)'  la  inf.dlv.,  in  the  sense  that  [9] 

(2.5)  ir(X,Y)/[Tr(X,X)tr(Y.Y)]l/2 

is  at  most  one  (and  at  least  zero,  since  ir(X,Y)  >  0  if  (X,Y)'  is  Inf.div.) 
The  meaning  of  a  value  of  one  for  (2.5)  when  (X,Y)’  has  no  normal  compo¬ 
nent  is  [91  that  there  exist  Independent  r.v.’s  U  and  V  and  a  constant  r 
such  that  X-EX  =  U+V,  Y-EY  =  r(U-V),  where  r  -  [iT(Y,Y)/Tr(X,X)]l/‘i.  If 
(X,Y)’  has  a  normal  component,  then,  when  (2.5)  has  a  value  of  one, 

(X-EX, Y-EY)  is  distributed  as  (Zi+U+V,Z2  +r(U-V)),  where  Is 

bivariate  normal. 

3.  SEVERAL  VARIABLES 

THEOREM  3.1.  Let  X  -  (Xj ,X2, . . . ,Xp) '  be  an  inf.div.  r.vec. 

(a)  The  variates  of  the  Poisson  component  of  X^  are  independent  iff. 

(3.1)  Tr(Xy,Xy)  »  0  for  all  u,v  such  that  u  ^  v. 

(b)  The  variates  of  51  are  Independent  iff,  7r(5ly,5Cy)  “  0  and 
v(51y,51y)  ■  0  for  all  u,v  such  that  u  *  v. 

(c)  The  variates  of  X  are  independent  iff.  T(5Cy,5l^)  -  0  for  all  u,v 
such  that  u  *  V, 

REMARKS.  (1)  Note  Chat,  in  particular,  pairwise  independence 
implies  mutual  Independence  in  the  multivariate  inf.div.  class.  (See 
also  Theorem  3.2  below.)  (11)  The  proof  given  below  is  essentially  the 
same  as  that  in  [9]  but  is  given  here  because  it  is  important  for  the 
understanding  of  Remark  (1),  which  in  turn  is  important  for  the  develop¬ 
ment  of  Inference  for  multivariate  inf.div.  distributions. 

PROOF,  (a)  The  Kolmogorov  representation  for  the  log  c.f.  in  the 
case  of  finite  variances  is  (l.l).  If  5Cy,  v  >  1,  2,...,  p,  are 
independent,  then,  in  particular,  51^  and  Xy  are  Independent,  and  cross- 
cufflulants  are  zero;  in  particular,  '^22^^*^^  "  if(5Cu,5Cy)  ••  0.  If, 
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conversely,  ir(Xy,Xy)  *  0  for  all  u,v  such  that  u  ^  v,  then  we  have 

0  “  I  I  ir(Xy,X^)  ■*  f  I  I  x2(x'x)"'^M(dx) . 

u,v  u,v  “  ^ 

u^ 

This  Implies  that 

(3.2)  n  x2  x2  =.  0 
u,v  V 

at  points  (x^ ,X2 , . . . ,Xp)  where  M  has  mass.  But  (3.2)  is  true  only  for 
points  (x^ ,X2 , . . . ,Xp)  with  at  most  one  non-zero  co-ordinate,  i.e.,  only 
on  the  axes  of  R^.  Therefore,  M  can  have  mass  only  on  the  axes.  Hence, 
M  =■  Mj  +  ...  +  Mp,  where  My^  has  mass  only  on  the  Xy-axis.  Let 
h(ui ,U2, . . . ,Up;x)  be  the  Integrand  of  (1.1).  Then  the  log  c.f.  of  the 
Poisson  component  is 


P 

'<)(u)  +  u'J[  u  -  iu'ji  =  /  h(ui,U2,..,Up;3c)  I  My(dx) 

v-1 


P 

“If  b(0, . . . ,0,Uy,0, . . . ,0;x)My(dx) 
v»l 


P 

“If  h(0, . . . ,0,Uy ,0, . . . ,0)M(dx) 
v-1 


P 

•  I  'l'v(^). 

v-1 

where  <|(y(uy)  is  the  log  c.f.  of  an  inf.div.  variate.  Thus  the  Poisson 
component  of  X  has  independent  variates. 

(b)  We  have  X  =  }^  +  where  and  are  independent.  It  suffices 
to  show  that  the  variates  of  5^  are  independent  and  the  variates  of 
are  independent.  Now,  (3.1)  gives  independence  of  the  variates  of  X„. 
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Further,  letting  3*'*^  similarly  for  5^,  we  have 

0  “  Cov(Xy,Xy)  “  Cov(X(J^  +  Xpy,  Xgy  +  Xpy),  which,  as  in  the  proof  of 
Theorem  2.3,  gives  Independence  of  Xgy  and  Xq^.  Thus  the  variates  of 
are  uncorrelated  and  jointly  normal,  and  hence  Independent. 

(c)  We  have  tCX^.X^)  »  Cov[(Xu-EXu)2 ,  (Xv-EXv)2]  -  vCX^.X^)  +  vCX^.X^). 
Further,  Tr(Xy,Xy)  >  0.  Hence,  t  »  0  iff.  v  «  0  and  ir  =  0,  i.e.,  iff. 

V  «  0  and  Cov(Xy,Xy)  *  0. 

THEOREM  3.2.  Let  X  be  an  inf.div.  r.vec.  Then  pairwise  independ¬ 
ence  of  the  elements  of  IC  is  equivalent  to  their  mutual  independence. 

REMARKS,  (i)  The  result  in  one  direction  is  trivial:  mutual  inde¬ 
pendence  is  stronger  than  pairwise  independence,  (il)  Now  suppose  the 
variates  are  pairwise  Independent.  Theorem  3.1  gives  the  result  in  the 
case  of  finite  fourth  moments.  The  result  is  proved  in  general,  without 
the  assumption  of  fourth  momenta,  in  [5).  In  doing  this,  the  Levy  repre¬ 
sentation  (see,  e.g.,  (12,  p.  160],  for  a  discussion  of  the  univariate 
case)  was  used.  It  is  of  the  form 

i(>(u)  “  iu'£  “  IL'L}L^^  ’*■  /[expdji'it)  "  1  ~  iji'x/(l  +  x'x)]L(dx) 

and  exists  even  when  the  variances  are  not  finite. 

4.  SOME  REMARKS  ON  INFERENCE 

Consider  the  decomposition  X '  =  3^  analogous  decomposi¬ 

tion  T  -  N  +  n,  where  T  -  [rlX^.X^)],  N  -  [v(Xy,Xy)],  jl  -  [tt(Xu,Xv)]. 

In  general,  one  must  consider  estimates  of  both  and  jl.  However,  if  X 
is  discrete  it  has  no  Gaussian  component  and  one  can  consider  JI  alone. 

Use  of  estimates  of  the  parameters  w(Xy,X^)  in  analyzing  a  set  of  data  is 
illustrated  in  [llj.  A  formula  for  an  unbiased  estimate  of  x(X^,Xy), 
which  is  the  cumulant  of  order  (2,2),  Is  given  in  a  list  of  bivariate 
k-statlstics  in  [3,  p.  329,  expression  ^13.2)]. 

4.1.  Testing  Independence 

In  [10]  a  test  of  Independence  of  X  and  Y,  where  (X,Y)'  is  an 
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Inf.dlv.  r.vec. ,  Is  given.  The  test  Is  based  on  the  asymptotic  normality 
of  a  sample  analog  of  t(X,Y).  That  Is,  It  Is  based  on  the  ratio  of  a 
sample  analog  of  this  parameter  to  Its  asymptotic  standard  error.  This 
statistic  Is  treated  as  a  normal  deviate.  Similarly,  In  [10]  a  test  of 
Independence  of  ,X2s . . .  ,Xp,  assuming  X  Is  Inf.dlv.  Is  based  on  the 
asymptotic  chi-square  distribution  of  a  suitable  quadratic  form  In  the 
statistics  tyv,  u,v  -  l,2,...p,  u  <  v,  where  t^^  Is  a  sample  analog  of 
t(Xu,Xv). 

4.2.  Testing  Normality 

In  the  continuous  case  one  would  want  to  test  whether  he  could 
restrict  attention  to  the  normal  family,  versus  using  the  full  Inf.dlv. 
class.  It  was  noticed  In  [2]  and  later  Independently  In  [9]  that  nullity 
of  the  fourth  cumulant  characterizes  the  normal  distribution  In  the  class 
of  Inf.dlv,  distributions.  Similarly  [9],  nullity  of  the  fourth 
cumulants  of  a  multivariate  distribution  characterizes  It  as  multivariate 
normal  In  the  class  of  multivariate  Inf.dlv.  distributions. 

Multivariate  goodness-of-flt  problems  seem  to  be  rather  difficult, 
and  the  problem  of  testing  multivariate  normality  has  received  a  fair 
amount  of  attention;  see  [3],  [7],  [8].  Sometimes  such  testing  problems 
are  considerably  simplified  when  the  class  of  alternatives  Is  reduced. 

When  the  class  Is  reduced  to  the  Inf.dlv.  laws,  still  a  large  class,  the 
resulting  testing  problem  Is  shown  to  be  quite  tractable,  due  to  the 
simple  characterization  of  the  normal  family  In  the  Inf.dlv.  class. 

As  remarked  above,  [10]  shows  how  to  test  Independence  of  the 
variates  of  an  Inf.dlv.  r.vec.  The  test  given  here  can  be  used  as  a 
preliminary  test  to  decide  whether  one  wants  to  use  the  full  Inf.dlv. 
model  or  rather  to  rely  on  the  normal  model.  In  the  normal  model  tests 
of  Independence  are  of  course  based  on  the  correlation  coefficients,  or, 
equivalently,  on  the  covariances.  In  the  full  Inf.dlv.  model,  tests  of 
independence  are  based  on  the  covariances  of  the  squares  of  the  centered 
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variables  [lOj. 

Given  any  random  vector  X,  let  LClC)  denote  the  probability  law  of  X. 
Let  C  denote  the  class  of  Inf.dlv.  laws  with  finite  eighth  moments  (see 
below) .  Let  N  denote  the  family  of  normal  laws .  We  wish  to  test  the 
hypothesis  H;  L(X)  is  in  N  against  alternatives  that  L(X)  is  in  C  -  N. 

The  hypothesis  test«  Recall  that  for  a  normal  distribution  iCj.  »  0 
for  r  >  3.  The  hypothesis  test  is  based  on  the  following  characteriza¬ 
tion  of  multivariate  normality  in  the  inf.div.  class,  due  to  [9]. 

THEOREM  4.1.  Let  X  be  an  inf.div.  r.vec.  with  variates  X^, 

V  =  l,2,...,p,  which  have  finite  fourth  moments.  Then  X  is  distributed 
according  to  a  multivariate  normal  distribution  iff.  <4(Xy)  “  0,  for 

V  =•  1,2,... ,p. 

REMARK.  Given  a  r.v.  Y,  let  Uj(Y)  denote  its  r-th  central  moment. 
Then  <4(Y)  -  U4(Y)  -  3[y2(Y)]2,  so  that  <4/ IJ2  ^3  a  conventional  measure 
of  kurtosis.  [6,  p.  881. 

An  unbiased  estimate  of  k;4  is  the  fourth  k-statlstic  [6,  formulas 
(12.28)  and  (12.29),  pp.  299-300],  viz., 

^4  “  [(n^+  n^)8^  -  4(n^+  0)8^3^  ~  3(n^-n)82  +  12n  828^  -  6s^]/nt'*J 

■  n2[(n+l)m4  -  3(n-l)m^]/(n-l)l^l , 

where,  for  a  univariate  sample  Yj »y2 ’ *  * ' ’^n’ 

nr  n 

~  y,  ®r  "  (Yi"  y 

^  1**1  1  i-1  ^ 

and 

n[r]  ■  n(n-l)...(n-r+l). 

Let 

<  -  (<  (X  ),  ....  K  (X„))’. 

~  4  1  4  P 

The  hypothesis  H  is  equivalent  to  jc  ■  £.  Let  jt  -  (k4(Xi),  ...,  k4(Xp))'. 
Let  ^(k)  denote  the  covariance  matrix  of  k.  Then,  by  the  asymptotic 


n 

-  .1 


i-1 


Yj^/n, 


i 


J 
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Joint  normality  of  functions  of  sample  moments,  the  quadratic  form 
(k-jc) '  [^(k)  is  asymptotically  distributed  according  to  the  chi- 

square  distribution  with  p  degrees  of  freedom  (d.f.).  Under  the 
hypothesis  the  quadratic  form  ^' ( [^(^) ]“^lc  is  asymptotically  distributed 
according  to  chi-square  with  p  d.f.  Further,  if  is  a  consistent 

estimate  of  ^(k) ,  then  quadratic  form  Q  *  k'[^(k)]“^k  is  under  H  also 
asymptotically  distributed  as  chi-square  with  p  d.f.  To  construct  a  test 
it  suffices  that  ^(k)  be  consistent  for  _£(^)  under  the  h}rpothesis. 

The  covariance  of  k4(X)  and  k4(Y)  is  obtainable  from  [6,  formula 
(13.58),  p,  3431,  as 

Cov[k^(X),k^(Y)l  »  '<^44/"  ^ 

+  +  (16<i3<3i  +  18K22)/(n-l) 

+  144niCjjiC2i'^12^^”~^^^^^  ^  24n(rrH)K^j/(n-l)^^^ . 

For  normal  distributions  Kjg  ■  0  if  r+s  >  3,  so  the  covariance  above 
reduces  to 


24n(n+l)Kj^/( (n-l)(n-2)(n-3)] ; 

also , 

Var(k4)  -  24n(n+l)K2/[(n“l)(n“2)(n-3)] . 

Thus  we  can  take  the  (u,v)-th  element  of  ^(^)  to  be 

24n(n+l)  /l(n-l)(n-2)(n-3)l , 
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where  Is  Che  usual  sample  covariance, 


8  -  I?  ,  (X  .-X  )(X  -X  )/(n-l),  u,  V  -  1,2,... ,p, 

uv  ^1»1  ui  U  vl  V 

“»er.  . X  )•,  I  -  1,2 . .  and  X^  - 


Ac  level  a  one  rejecCs  Che  hjrpoChesls  of  normallcy  if  Q  exceeds  Che 
upper  ot-ch  percenCage  polnc  of  Che  chi-square  discribucion  wich  p  d.f. 


Example.  The  procedure  is  illusCraced  for  Che  Fisher  Iris  daCa. 

Flfcy  observaclons  were  Caken  from  each  of  che  populations.  Iris  setbsa, 
versicolor,  and  vlrglnlca.  The  variables  are  x  *  sepal  length,  x  ■ 
sepal  width,  x  *  petal  length,  and  x  ■  petal  width.  The  table  gives 
the  values  of  che  coefficient  of  kurtosis,  b2  ■  k4/m2,  where  k^  and  m2 
are  as  defined  above.  The  variance  of  b2  for  a  normal  parent  is 
approximately  24/n.  Since  here  n  *  50,  the  standard  error  of  any  one  b2 
in  che  Cable  is  approximately  0.693.  One  could  use  this  to  test  the 
significance  of  any  one  of  the  twelve  b2's  given  in  the  table,  but  then 
one  would  run  into  Che  multiple-comparisons  problem.  The  multivariate 
test  avoids  this  problem.  The  chi-square  values  in  the  table  are  values 
of  quadratic  forms  in  the  individual  k4’s  for  the  four  separate  variables 
and  hence  may  be  considered  as  quadratic  forms  in  the  b2's  for  the  four 
separate  variables.  For  Iris  setosa  the  chi-square  value  is  almost 
significant  at  the  5%  level.  For  Che  other  two  species,  Che  values  are 
not  significant.  Adding  Che  three  independent  chi-squares,  we  have 
9.390  +  1.167  +  1.770  -  12.327,  with  12  d.f.;  P  -  .43. 
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COEFFICIENTS  OF  KURTOSIS 
FOR  THE  FOUR  VARIABLES  OF  THE  FISHER  IRIS  DATA, 
WITH  VALUES  OF  CHI-SQUARE  FOR  TESTING  NORMALITY 
AGAINST  INFINITELY  DIVISIBLE  ALTERNATIVES 


Iris 

setosa 

Iris 

versicolor 

Iris 

vlrRlnlca 

Values  of  coefficients  of 

kurtosls: 

Variable 

1 

-0.263 

-0.555 

0.034 

Variable 

2 

0.994 

-0.381 

0.735 

Variable 

3 

1.064 

0.050 

-0.160 

Variable 

4 

1.790 

-0.427 

-0.627 

Values  of  chi-square  test  statistic  (4  d.f.): 

9.390 

1.167 

1.770 

(P-.052) 

(P-.88) 

(P-.78) 
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